P-Gram: Positional N-Gram for the Clustering of Machine-Generated Messages
نویسندگان
چکیده
منابع مشابه
N-gram-based Machine Translation
This article describes in detail an n-gram approach to statistical machine translation. This approach consists of a log-linear combination of a translation model based on n-grams of bilingual units, which are referred to as tuples, along with four specific feature functions. Translation performance, which happens to be in the state of the art, is demonstrated with Spanish-to-English and English...
متن کاملAspect Clustering Combined N-gram for Reviews
With the increase in popularity of e-commerce, more and more customer reviews are available online, it’s usually hard to go through each of them. Latent Dirichlet Allocation (LDA) was used to mine product aspect. Considering the weakness of standard LDA when processing review text, we defined the product aspect model, and predefined aspects for different domains, and proposed aspect model combi...
متن کاملN-Gram against the Machine: On the Feasibility of the N-Gram Network Analysis for Binary Protocols
In recent years we have witnessed several complex and highimpact attacks specifically targeting “binary” protocols (RPC, Samba and, more recently, RDP). These attacks could not be detected by current – signature-based – detection solutions, while – at least in theory – they could be detected by state-of-the-art anomaly-based systems. This raises once again the still unanswered question of how e...
متن کاملN-Gram Posterior Probabilities for Statistical Machine Translation
Word posterior probabilities are a common approach for confidence estimation in automatic speech recognition and machine translation. We will generalize this idea and introduce n-gram posterior probabilities and show how these can be used to improve translation quality. Additionally, we will introduce a sentence length model based on posterior probabilities. We will show significant improvement...
متن کاملUsing N-gram based Features for Machine Translation System Combination
Conventional confusion network based system combination for machine translation (MT) heavily relies on features that are based on the measure of agreement of words in different translation hypotheses. This paper presents two new features that consider agreement of n-grams in different hypotheses to improve the performance of system combination. The first one is based on a sentence specific onli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2019
ISSN: 2169-3536
DOI: 10.1109/access.2019.2924928